Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[OPIK-590] Online Scoring Automation Rules endpoints #945

Merged
merged 19 commits into from
Jan 3, 2025

Conversation

ldaugusto
Copy link
Contributor

@ldaugusto ldaugusto commented Dec 20, 2024

Details

Data model, DAO, service layer and endpoints for Automation Rule Evaluators.

We're using a polymorphic type for the evaluator code, which at least for now for LLM-as-Judge, we will use a JsonNode type

Issues

OPIK-590
OPIK-591

Testing

Documentation

Data model, DAO, service layer and endpoints for Automation Rule Evaluators.

I've started with a more complicated approach, but in the end decided for a much simpler one which hopefully is easy enough to extend later into non-evaluator automation rules. Also as we've decided for a non-validation approach in the code, we're just going with a text field.

## Issues
OPIK-590
OPIK-591
@ldaugusto ldaugusto requested a review from andrescrz December 20, 2024 13:57
@ldaugusto ldaugusto requested a review from a team as a code owner December 20, 2024 13:57
Copy link
Collaborator

@andrescrz andrescrz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As discussed in the design, the rule, action and evaluator can be up to 3 different entities. Fine to go with an approach where they're all collapsed into 1 in order to deliver faster. However, this can make extendability for future requirements harder.

For this PR, focus on making the evaluator payload a polymorphic value, so we can define any type of evaluator (python, llm as a judge). The current code field of type string is a concerning implementation.

Then review a bit the paths of the new resource and adjust them.

Everything else is minor polishings. I didn't go in the tests deeply, as still commented out and because this is in-progress.

Thank you very much for putting this together and congrats for your first PR in Opik!

return new AutomationRuleEvaluator.AutomationRuleEvaluatorPage(page, 0, 0, List.of());
}
}
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor: run spotless on your PRs, so code is automatically formatted.

This is a previous miss from our side, we need to automate this.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

New line in the end would be nice.

Copy link
Collaborator

@andrescrz andrescrz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is in better state, but let's keep polishing.

Copy link
Contributor

@BorisTkachenko BorisTkachenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good, left some comments



@Override
public void update(@NonNull UUID id, @NonNull UUID projectId, @NonNull String workspaceId,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For consistency it would be better to add logs to all methods. Or remove from all, if we could use logs from Resource class.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

adding log

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

save and delete methods do not have logging

return new AutomationRuleEvaluator.AutomationRuleEvaluatorPage(page, 0, 0, List.of());
}
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

New line in the end would be nice.



@Override
public void update(@NonNull UUID id, @NonNull UUID projectId, @NonNull String workspaceId,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

save and delete methods do not have logging

Comment on lines +115 to +126
private UUID create(AutomationRuleEvaluator<?> evaluator, String apiKey, String workspaceName) {
try (var actualResponse = client.target(URL_TEMPLATE.formatted(baseURI, evaluator.getProjectId()))
.request()
.accept(MediaType.APPLICATION_JSON_TYPE)
.header(HttpHeaders.AUTHORIZATION, apiKey)
.header(WORKSPACE_HEADER, workspaceName)
.post(Entity.json(evaluator))) {

assertThat(actualResponse.getStatusInfo().getStatusCode()).isEqualTo(201);

return TestUtils.getIdFromLocation(actualResponse.getLocation());
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please move it to test client (ex. TraceResourceClient), in case we need it in other tests. Now we have a lot of code duplication in tests, due to making such same calls in each test class.

Copy link
Contributor

@BorisTkachenko BorisTkachenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good

@ldaugusto ldaugusto merged commit 7f5175d into main Jan 3, 2025
7 checks passed
@ldaugusto ldaugusto deleted the daniel/automation-rule-evaluator-endpoints branch January 3, 2025 11:01
@ldaugusto ldaugusto changed the title [Online Scoring] Automation Rules endpoints [OPIK-590] Online Scoring Automation Rules endpoints Jan 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants